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Nucleotide Sequences of the Nucleocapsid (NP) and Phosphoprotein (P) Genes of a 
Malaysian Velogenic Newcastle Disease Virus Strain AF2240 and the Production of 
the NP and P Proteins in Escherichia coli 



Field of the Invention 

5 The present invention relates to nucleotide sequences encoding the nucleocapsid (NP) 

protein and phosphoprotein (P) of Newcastle disease virus (NDV) strain AF2240, and the 
production of the corresponding proteins with recombinant plasmids bearing the 
nucleotide sequences in Escherichia coli. 



1 0 Description of the Prior Art 

Newcastle disease virus (NDV) is the prototype of avian paramyxovirus, which causes a 
highly contagious disease known as Newcastle disease (ND) in many avian species. This 
disease is of great economic importance requiring control by vaccination or quarantine 
with slaughter of all birds in confirmed outbreaks, resulting in substantial losses in the 
15 poultry industry worldwide. Therefore, development of an improved vaccine and also a 
rapid and sensitive diagnostic test are greatly desired by the poultry industry. 

A Malaysian heat resistant NDV strain AF2240, which causes 100% mortality in 
susceptible chicken flocks has been reported by Abdul Rahman et al (1976) and Lai, 
CM. (1985). Further studies by Idris et al (1993) revealed that the thermostabilities of 
20 haemagglutination and neuraminidase activities of this AF2240 strain were found to be 
higher than those of other strains. The basis giving rise to these unique features is still 
unknown. However a comprehensive understanding of the viral proteins would provide 
some solutions and useful information for the development of heat stable recombinant 
vaccines and diagnostic tests. 

25 The genome of NDV is a linear, non-segmented, single-stny^^ RNA 
with a molecular weight of 5.2-5.7x1 0 6 Daltons, or apor^xtaK^^ which 
encodes six main structural proteins. The genomi^R^A^^^ the 



nucleocapsid (NP), phosphoprotein (P) and large (L^^rot^j^. T^sV macrofiiolecules 





form the transcriptive complex of the virus, which in turn is surrounded by a lipid bilayer 
membrane derived from the host cell. Embedded in the membrane are the 
haemagglutinin-neuraminidase (HN) and fusion (F) glycoproteins. Beneath the lipid 
bilayer is a shell of protein known as the matrix (M) protein, which is believed to interact 
5 with the transcriptive complex. The HN and F glycoproteins are associated with the host 

cell receptor during infection. The NP encapsidates the viral RNA together with the L 
protein which is thought to be the transcriptase, and a P protein with an unknown reason. 

The genes encoding for the HN (EMBL/Gen Bank/DDBJ accession No.X70092) 5 F 
(EMBL/Gen Bank/DDBJ accession No.AF048763) and M (EMBL/Gen Bank/DDBJ 

10 accession No. AF060563) proteins of the NDV strain AF2240 have been completely 
sequenced by Tan et al. (1995), Salih et al. (2000) and Jemain, S.F.P. (1999) respectively. 
From the HN gene sequence of strain AF2240, it was quite clear that this strain is 
different from the other published NDV strains. The HN protein lacked the Arg (403) 
residue and contained 581 amino acids. At the time when the project was initiated, there 

1 5 was no information available on the coding sequences for the NP and P proteins of NDV 
strain AF2240. Therefore it remained a problem to prepare cDNA for the cloning of the 
NP and P genes of NDV. 

The inventors have now successfully determined the nucleotide sequences encoding the 
NP and P proteins of NDV strain AF2240. The accession numbers for the genes encoding 

20 the NP and P proteins are EMBL/Gen Bank/DDBJ No. AF284646 and AF284647 
respectively. The inventors had discovered that the proteins, in either non-fusion or fusion 
forms bearing the myc epitope and six residues of His at their carboxyl terminal end could 
be successfully produced in E. coli by means of recombinant DNA technologies. The NP 
and P proteins were expressed to a substantial level in the bacteria and can be recognised 

25 by chicken anti-NDV serum. 

Summary of invention 

The present invention provides nucleotides encoding the full length NP and P 
polypeptides of Newcastle disease virus strain AF2240. Whereas the genome of NDV is 
of length approximately 15,000 nucleotides, it has been determined, by this invention, that 
30 the portion coding for the NP polypeptide is approximately 1470 nucleotides long and the 
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portion that codes for the P polypeptide is approximately 1188 nucleotides long. 
Accordingly, one aspect of the present invention provides for the coding regions of the 
nucleocapsid (NP) and phosphoprotein (P) genes of Newcastle disease virus strain 
AF2240. Both the nucleotide sequences are as listed below: 

5 NP coding region 

10 20 30 40 50 60 

ATGTCTTCCG TATTCGATGA ATACGAGCAG CTCCTCGCTG CTCAGACTCG CCCCAATGGA 

70 80 90 100 110 120 

10 GCTCACGGAG GGGGAGAGAG AGGGAGCACT TTAAGAGTTG AGGTCCCAGT ATTCACTCTT 

' 130 140 150 160 170 180 

AACAGTGACG ATCCAGAAGA TAGATGGAAT TTTGCGGTAT TCTGTCTTCG GATTGCTGTT 

190 200 210 220 230 240 

AGCGAGGACG CCAACAAACC GCTCAGGCAA GGTGCTCTCA TATCCCTCCT GTGCTCCCAT 

15 250 260 270 280 290 300 

TCTCAAGTGA TGAGGAACCA TGTTGCCCTT GCAGGAAAAC AGAATGAGGC TACACTGACT 

310 320 330 340 350 360 

GTTCTTGAGA TCGATGGTTT TACCAGCAGC GTGCCTCAGT TCAACAACAG GAGTGGGGTG 

370 380 390 400 410 420 

20 TCTGAGGAGA GAGCACAGAG ATTCATGGTG ATAGCAGGGT CTCTCCCTCG GGCGTGCAGT 

430 440 450 460 470 480 

AACGGTACTC CGTTCGTCAC GGCTGGGGTT GAAGATGATG CACCAGAAGA TATCACTGAT 

490 500 510 520 530 540 

ACTCTGGAAA GAATCCTGTC TATCCAGGCT CAGGTATGGG TCACAGTAGC GAAGGCCATG 

25 550 560 570 580 590 600 

ACTGCATATG AGACAGCAGA TGAGTCGGAA ACAAGAAGAA TCAATAAGTA CATGCAGCAA 

610 620 630 640 650 660 

GGCAGAGTCC AGAAGAAGTA CATCCTCCAC CCTGTATGCA GGAGTGCAAT TCAACTCACA 



670 680 690 

ATCAGACATT CTCTGGCAGT CCGCATTTTC 

730 740 750 

ACGGCAGGTG GGAGCTCCAC GTATTACAAC 

790 800 810 

AACACCGGAC TTACTGCATT CTTCCTTACA 

850 860 870 

GCCCTAGCAC TCAGCAGCCT CACAGGCGAT 

910 920 930 

TATCGGATGA AGGGAGAAAA TGCGCCGTAC 

970 980 990 

AGCTTTGCAC CGGCTGAGTA TGCACAGCTT 

1030 1040 1050 

TTAGATAAAG GAACTGGCAA ATACCAATTC 

1090 1100 1110 

AGACTCGGGG TGGAGTATGC TCAGGCTCAG 

1150 1160 1170 

GAGCTAAAAC TAACCCCGGC AGCAAGAAGG 

1210 1220 1230 

GAGGAAACTG GCAGCGTGGA TATTCCTACT 

1270 1280 1290 

GATGGAGGCC CCCGAGCCTC TCAGGGTGGA 

1330 1340 1350 

GGAGATGGGG AGACCCAATT CTTGGATTTG 



•700 710 720 

TTAGTTAGCG AGCTTAAGAG AGGCCGCAAT 

760 770 730 

TTAGTAGGGG ATGTAGACTC ATACATCAGG 

820 830 840 

CTCAAATATG GAATTAATAC CAAGACATCA 

880 890 900 

ATCCAAAAGA TGAAGCAGCT CATGCGTTTA 

940 950 960 

ATGACATTGC TAGGTGACAG TGATCAGATG 

1000 1010 1020 

TATTCTTTTG CCATGGGCAT GGCATCAGTC 

1060 1070 1080 

GCCAGAGACT TCATGAGCAC ATCATTCTGG 

1120 1130 1140 

GGGAGTAGCA TCAACGAAGA CATGGCTGCT 

1180 1190 1200 

GGCCTGGCAG CTGCTGCCCA ACGAGTGTCT 

1240 1250 1260 

CAACAAGCCG GGGTCCTCAC TGGGCTCAGC 

1300 1310 1320 

TCGAACAAGT CGCAAGGGCA ACCAGATGCC 

1360 1370 1380 

ATGAGAGCAG TGGCGAACAG CATGCGAGAA 



1390 1400 1410 1420 1430 1440 

GCGCCAAACT CCGCACAGAG CACCACCCAC CCGGAACCCC CCCCGACTCC CGGGCCATCA 
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1450 1460 1470 1480 1490 1500 

CAAGATAACG ACACCGACTG GGGGTATTGA , 

P gene coding region 

10 20 30 40 50 60 

5 ATGGCCACCT TTACAGATGC GGAGATAGAT GATATATTTG AGACCAGTGG AACTGTCATT 

70 80 90 100 110 120 

GACAGCATAA TTACGGCCCA GGGTAAATCA GCAGAGACTG TCGGAAGGAG CGCAATCCCA 

130 140 150 160 170 180 

CAAGGCAAGA CCAAAGCGCT GAGCATAGCA TGGGAGAAGC ATGGGAGCAT CCAACCATCC 

10 190 200 210 220 230 240 

ACCAGCCAGG ACAACCCCGA CCAACAGGAT AGACCAGACA AACAGCTATC CACACCTGAG 

250 260 270 280 290 300 

CAGGCGACCC CACACAACAG CTCGCCAGCC ACATCCGCCG AACCGCTCCC CACTCAGGCC 

310 320 330 340 350 360 

15 GCAGGTGAGG CCGGCGACAC ACAGCTCAAG ACCGGAGCAA GCAACTCTCT TCTGTCTATG 

370 380 390 400 410 420 

CTCGACAAGC TGAGCAATAA ACCATCTAAT GCTAAAAAGG GCCCATGGTC GAGTCCCCAG 

430 440 450 460 470 480 

GAAGGATATC ATCAACCTCC GACCCAACAA CATGGGGATC AGCCGAACCG CGGAAACAGC 

20 490 500 510 520 530 540 

CAGGAGAGGC TGCGGCACCA AGCCAAGGCC GCCCCTGGAA GCCGGGGCAC AGACGCGAGC 

550 560 570 580 590 600 
ACAGCATATC ATGGACAATG GAAGGAGTCA CAACTATCAG CTGGTGCAAC CCCTCATGTG 

610 620 630 640 650 660 
25 CTCCAATCAG GGCAGAGCCA AGACAGTACT CCTGTACCTG TGGATCATGT CCAGCCACCT 



670 680 
GTCGACTTTG TGCAGGCGAT 



690 700 710 

GATGACTATG ATGGAGGCGT TATCACAGAA 



720 

GGTAAGTAAA 
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730 740 750 760 770 780 

GTCGACTATC AGCTAGACCT AGTCTTAAAG CAGACATCCT CCATCCCTAT GATGCGGTCT 

790 800 810 820 830 840 

GAAATCCAAC AGCTAAAAAC ATCTGTTGCG GTCATGGAAG CTAATTTAGG CATGATGAAA 

5 850 860 870 880 890 900 

ATTCTGGACC CTGGTTGTGC TAACATTTCA TCCTTAAGTG ATCTGCGGGC AGTCGCCCGG 

910 920 930 940 950 960 

TCCCACCCAG TTTTAATTTC AGGCCCCGGA GATCCGTCCC CCTACGTGAC ACAAGGGGGT 

970 980 990 1000 1010 1020 

10 GAGATGACAC TCAATAAACT CTCACAACCA GTACAACACC CTTCCGAGTT AATTAAATCT 

1030 1040 1050 1060 1070 1080 

GCCACAGCGG GCGGACCTGA TATGGGAGTG GAAAAGGACA CTGTCCGTGC ATTGATCACC 

1090 1100 1110 1120 1130 1140 

TCGCGCCCGA TGCATCCAAG CTCCTCAGCT AAGCTCCTGA GTAAGCTGGA TGCAGCCGGG 

15 1150 1160 1170 1180 1190 1200 
TCGATTGAAG AGATCAGAAA GATCAAGCGC CTTGCACTAA ATGGCTAA 



Further, the present invention provides the amino acid sequences of both the NP and P 
proteins as listed below: 

NP gene : amino acid sequence 



20 1MSSVFDEYEQLLAAQT 16 

ATG TCT TCC GTA TTC GAT GAA TAC GAG CAG CTC CTC GCT GCT CAG ACT 
1 10 20 30 40 



25 



17 



RPNGAHGGGERGSTLR 
CGC CCC AAT GGA GCT CAC GGA GGG GGA GAG AGA GGG AGC ACT TTA AGA 
50 60 70 80 90 



32 
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33VEVPVFTLNS DDPEDR 48 
GTT GAG GTC CCA GTA TTC ACT CTT AAC AGT GAC GAT CCA GAA GAT AGA 
100 110 120 130 140 

49WNFAVFCLRIAVSEDA 64 
5 TGG AAT TTT GCG GTA TTC TGT CTT CGG ATT GCT GTT AGC GAG GAC GCC 

150 160 170 180 190 

65NKPLRQGALI SLLCSH 80 
AAC AAA CCG CTC AGG CAA GGT GCT CTC ATA TCC CTC CTG TGC TCC CAT 
200 210 220 230 240 

10 81 SQVMRNHVALAGKQNE 96 

TCT CAA GTG ATG AGG AAC CAT GTT GCC CTT GCA GGA AAA CAG AAT GAG 
250 260 270 280 

97ATLTVLEIDGFTSSVP 112 
GCT ACA CTG ACT GTT CTT GAG ATC GAT GGT TTT ACC AGC AGC GTG CCT 
15 290 300 310 320 330 

113 QFNNRSGVSEERAQRF 128 
CAG TTC AAC AAC AGG AGT GGG GTG TCT GAG GAG AGA GCA CAG AGA TTC 
340 350 360 370 380 

129 MVIAGSLPRACSNGTP 144 
20 ATG GTG ATA GCA GGG TCT CTC CCT CGG GCG TGC AGT AAC GGT ACT CCG 

390 400 410 420 430 

145 FVTAGVEDDAPEDITD 160 
TTC GTC ACG GCT GGG GTT GAA GAT GAT GCA CCA GAA GAT ATC ACT GAT 
440 450 460 470 480 

25 161 TLERI LSIQAQVWVTV 176 

ACT CTG GAA AGA ATC CTG TCT ATC CAG GCT CAG GTA TGG GTC ACA GTA 
490 500 510 520 

177 AKAMTAYETADESETR 192 
GCG AAG GCC ATG ACT GCA TAT GAG ACA GCA GAT GAG TCG GAA ACA AGA 
30 530 540 550 560 570 

193 RINKYMQQGRVQKKYI 208 
AGA ATC AAT AAG TAC ATG CAG CAA GGC AGA GTC CAG AAG AAG TAC ATC 
580 590 600 610 620 
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209 LHPVCRSAIQLT IRHS 224 
CTC CAC CCT GTA TGC AGG AGT GCA ATT CAA CTC ACA ATC AG A CAT TCT 
630 640 . 650 660 670 

225 L A V R I F L V S ' E L ' K R G R N 240 
CTG GCA GTC CGC ATT TTC TTA GTT AGC GAG CTT AAG AGA GGC CGC AAT 
680 690 700 710 720 

241 TAGGSSTYYNLVGDVD 256 
ACG GCA GGT GGG AGC TCC ACG TAT TAC AAC TTA GTA GGG GAT GTA GAC 
730 740 750 760 

257 SYIRNTGLTAFFLTLK 272 
TCA TAC ATC AGG AAC ACC GGA CTT ACT GCA TTC TTC CTT ACA CTC AAA 
770 780 790 800 810 

273 YGINTKTSALALSSLT 288 
TAT GGA ATT AAT ACC AAG ACA TCA GCC CTA GCA CTC AGC AGC CTC ACA 
820 830 840 850 860 

289 GDIQKMKQLMRLYRMK 304 
GGC GAT ATC CAA AAG ATG AAG CAG CTC ATG CGT TTA TAT CGG ATG AAG 
870 880 890 900 910 

305 GENABYMTLLGDS DQM 320 
GGA GAA AAT GCG CCG TAC ATG ACA TTG CTA GGT GAC AGT GAT CAG ATG 
920 930 940 950 960 

321 S FAPAEYAQLYS FAMG 336 
AGC TTT GCA CCG GCT GAG TAT GCA CAG CTT TAT TCT TTT GCC ATG GGC 
970 980 990 1000 

25 337 MASVLDKGTGKYQ FAR 352 

ATG GCA TCA GTC TTA GAT AAA GGA ACT GGC AAA TAC CAA TTC GCC AGA 
1010 1020 1030 1040 1050 

353 DFMSTSFWRLGVEYAQ 368 
GAC TTC ATG AGC ACA TCA TTC TGG AGA CTC GGG GTG GAG TAT GCT CAG 
30 1060 1070 1080 1090 1100 



20 



369 



AQGSSINEDMAAELKL 
GCT CAG GGG AGT AGC ATC AAC GAA GAC ATG GCT GCT GAG CTA AAA CTA 
illO 1120 1130 1140 1150 



384 
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385 T PAARRGLAAAAQRVS 
ACC CCG GCA GCA AGA AGG GGG CTG GCA GCT GCT GCC CAA CGA GTG TCT 
1160' 1170 1180 1190 1200 

401 E ETG SVDI PT Q 'QAGVL 
GAG GAA ACT GGC AGC GTG GAT ATT CCT ACT CAA CAA GCC GGG GTC CTC 
1210 1220 1230 1240 

417 TGLS DGGPRASQGGSN 
ACT GGG CTC AGC GAT GGA GGC CCC CGA GCC TCT CAG GGT GGA TCG AAC 
1250 1260 1270 1280 1290 

433 KSQGQPDAGDGETQFL 
AAG TCG CAA GGG CAA CCA GAT GCC GGA GAT GGG GAG ACC CAA TTC TTG 
1300 1310 1320 1330 1340 

449 DLMRAVANSMREAPNS 
GAT TTG ATG AGA GCA GTG GCG AAC AGC ATG CGA GAA GCG CCA AAC TCC 
1350 1360 1370 1380 1390 

465 AQSTTHPEPPPTPGPS 
GCA CAG AGC ACC ACC CAC CCG GAA CCC CCC CCG ACT CCC GGG CCA TCC 
1400 1410 1420 1430 1440 

481 QDNDTDWGY + 
CAA GAT AAC GAC ACC GAC TGG GGG TAT TGA 
1450 1460 1470 



P gene : amino acid sequence 



1MAT FTDAEIDDI FETS 
ATG GCC ACC TTT ACA GAT GCG GAG ATA GAT GAT ATA TTT GAG ACC AGT 
1 10 20 30 40 

17GTVIDSIITAQGKSAE 
GGA ACT GTC ATT GAC AGC ATA ATT ACG GCC CAG GGT AAA TCA GCA GAG 
50 60 70 80 90 



33 



TVGRSA .I PQGKTKALS 
ACT GTC GGA AGG AGC GCA ATC CCA CAA GGC AAG ACC AAA GCG CTG AGC 
100 110 120 130 140 
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49IAWEKHGSIQPSTSQD 64 
ATA GCA TGG GAG AAG CAT GGG AGC ATC CAA CCA TCC ACC AGC CAG GAC 
150 160 170 180 .190 

65NPDQQD RPDKQLSTPE 80 
5 AAC CCC GAC CAA CAG GAT AGA CCA GAC AAA CAG CTA TCC ACA CCT GAG 

200 210 220 230 240 

81QATPHNSSPATSAEPL 96 
CAG GCG ACC CCA CAC AAC AGC TCG CCA GCC ACA TCC GCC GAA CCG CTC 
250 260 270 280 

10 97 PTQAAGEAGDTQLKTG 112 

CCC ACT CAG GCC GCA GGT GAG GCC GGC GAC ACA CAG CTC AAG ACC GGA 
290 300 310 320 330 

113 ASNSLLSMLDKLSNKP 128 
GCA AGC AAC TCT CTT CTG TCT ATG CTC GAC AAG CTG AGC AAT AAA CCA 
340 350 360 370 380 



15 
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129 SNAKKGPWSSPQEGYH 144 
TCT AAT GCT AAA AAG GGC CCA TGG TCG AGT CCC CAG GAA GGA TAT CAT 
390 400 410 420 430 

145 QPPTQQHGDQPNRGNS 160 
CAA CCT CCG ACC CAA CAA CAT GGG GAT CAG CCG AAC CGC GGA AAC AGC 
440 450 460 470 480 

161 QERLRHQAKAAPGSRG 176' 
CAG GAG AGG CTG CGG CAC CAA GCC AAG GCC GCC CCT GGA AGC CGG GGC 
490 500 510 520 

25 177 TDASTAYHGQWKESQL 192 

ACA GAC GCG AGC ACA GCA TAT CAT GGA CAA TGG AAG GAG TCA CAA CTA 
530 540 550 560 570 



30 



193 SAGAT PHVLQSGQSQD 208 
TCA GCT GGT GCA ACC CCT CAT GTG CTC CAA TCA GGG CAG AGC CAA GAC 
580 590 600 610 620 

209 STPVPVDHVQPPVDFV 224 
AGT ACT CCT GTA CCT GTG GAT CAT GTC CAG CCA CCT GTC GAC TTT GTG 
630 640 650 660 670 
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225 QAMMTMMEALSQKVSK 240 
CAG GCG ATG ATG ACT ATG ATG GAG GCG TTA TCA CAG AAG GTA AGT AAA 
680 690 • 700 710 . 720 

241 VDYQLDLVLKQTSSI P 256 
5 GTC GAC TAT CAG CTA GAC CTA GTC TTA AAG CAG ACA TCC TCC ATC CCT 

730 .740 750 760 

257 MMRSEIQQLKTSVAVM 272 
ATG ATG CGG TCT GAA ATC CAA CAG CTA AAA ACA TCT GTT GCG GTC ATG 
770 780 790 800 810 

10 273 EANLGMMKILDPGCAN 288 

GAA GCT AAT TTA GGC ATG ATG AAA ATT CTG GAC CCT GGT TGT GCT AAC 
820 830 840 850 860 

289 ISSLSDLRAVARSHPV 304 
ATT TCA TCC TTA AGT GAT CTG CGG GCA GTC GCC CGG TCC CAC CCA GTT 
15 870 880 890 900 910 

305 LISGPGDPSPYVTQGG 320 
TTA ATT TCA GGC CCC GGA GAT CCG TCC CCC TAC GTG ACA CAA GGG GGT 
920 930 940 950 960 

321 EMTLNKLSQPVQHPSE 336 
20 GAG ATG ACA CTC AAT AAA CTC TCA CAA CCA GTA CAA CAC CCT TCC GAG 

970 980 990 1000 

337 LIKSATAGGPDMGVEK 352 
TTA ATT AAA TCT GCC ACA GCG GGC GGA CCT GAT ATG GGA GTG GAA AAG 
1010 1020 1030 1040 1050 

25 353 DTVRALITSRPMHPSS 368 

GAC ACT GTC CGT GCA TTG ATC ACC TCG CGC CCG ATG CAT CCA AGC TCC 
1060 1070 1080 1090 1100 

369 SAKLLSKLDAAGS IEE 384 
TCA GCT AAG CTC CTG AGT AAG CTG GAT GCA GCC GGG TCG ATT GAA GAG 
1110 11210 1130 1140 1150 

385 I RKI KRLALNG* 396 
ATC AGA AAG ATC AAG CGC CTT GCA CTA AAT GGC TAA 
1160 1170 1180 



30 
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A primary use of the nucleotides as defined above is for the creation of plasmids usin^ 
recombinant DNA technologies. The resulting recombinant molecule can then be 
introduced into an appropriate host. The plasmids thus created can be used to encode NP 
and P proteins. For expression of the NP and P proteins, any of the common expression 

5 vectors, especially the bacterial vectors can be used. The usable bacterial hosts for the 

vectors include any of the conventional prokaryotic cells. In this invention, the bacterial 
host used was Escherichia coli. Accordingly, a further aspect of the present invention 
provides for a prokaryotic cell, such as for example a bacterial cell and in particular an E. 
coli cell containing the nucleotides as defined above for the production of NP and P 

10 proteins. 

The NP and P proteins, produced using recombinant plasmids in accordance with the 
present invention, can be in the fusion or non-fusion forms. In accordance with the 
embodiment of the present invention, it provides a method for producing the fusion and 
non-fusion forms of both the NP and P proteins of NDV virus strain AF2240 in an E. coli 
15 system. The preferred method for producing the fusion and non-fusion forms of both the 
NP and P proteins of NDV virus strain AF2240 comprises culturing the transformed E. 
coli of the present invention on an appropriate medium to express the said nucleocapsid 
protein and phosphoprotein, and isolating and purifying the expressed fusion proteins 
from the cultures. 

20 While the invention will now be described in connection with certain preferred 
embodiments in the following experiments so that aspects thereof may be more fully 
understood and appreciated, it is not intended to limit the invention to these particular 
embodiments. On the contrary, it is intended to cover all alternatives, modifications and 
equivalents as may be included within the scope of the invention as defined by the 

25 appended claims. 

Brief description of the figures 

Figure 1 is a western blot of NDV nucleocapsid protein (NP) expressed by transformed E. 
coli TOP 10 containing plasmid pTrcHis2-NP 

Figure 2 is a western blot of NDV phosphoprotein (P) expressed by transformed E. coli 
30 TOP 1 0 containing plasmid pTrcHis2-P 
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Detailed description of the invention 

The present invention was accomplished through the employment of the recombinant 
DNA techniques which comprises the amplification of the NP and P coding regions of 
NDV strain AF2240. the cloning of the genes into the expression vector, the production of 
5 the transformed E. coli, the cultivation of the transformant, the expression of the NP and P 

proteins and the purification of the expressed fusion proteins. 

The NP and P coding regions of NDV strain AF2240 which had been cloned into the 
expression vector were prepared through reverse transcription-polymerase chain reaction 
(RT-PCR). Three primers were used for each gene, which consisted of one forward and 
1 0 two reverse primers as listed below: 

For the amplification of the NP gene 

NPfl (20 mer)\ 5'- cct tct gcc aac atg tct tc -3' (Forward primer) 
NPrl (20 mer): 5'- tea ata ccc cca gtc ggt gt -3' (Reverse primer) 
NPr2 (18 mer): 5'- ata ccc cca gtc ggt gtc -3 5 (Reverse primer) 

15 For the amplification of the P gene 

Pfl (20 mer): 5'- atg gcc acc ttt aca gat gc -3' (Forward primer) 
Prl (23 mer): 5'- taa tta gcc att tag tgc aag gc -3' (Reverse primer) 
Pr2 (21 mer): 5'- gcc att tag tgc aag gcg ctt -3' (Reverse primer) 

20 Incorporation of primers designated as NPfl and NPrl (for the NP gene), or Pfl and Prl 
(for the P gene) during PCR had amplified gene products containing a stop codon at their 
3' ends, while the presence of primers NPfl and NPr2 (for the NP gene) or Pfl and Pr2 
(for the P gene) gave rise to genes without any no stop codon. For cloning and expression 
purposes, a commercially available expression vector, pTrcHis2 (Invitrogen, USA) 

25 containing the coding regions for the myc epitope and 6 His residues downstream of the 
multiple cloning site was used. After cloning of the respective coding regions of NP and P 
genes into the pTrcHis2 vector, they were subsequently introduced into a bacterial host E. 
coli TOP 10. The resulting plasmid harbouring the NP gene was designated as pTrcHis2- 
NP while the other one with the P gene as an insert was denoted as pTrcHis2-P. Both the 
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NP and P proteins were expressed in E.coli TOP 10 cells as. non-fusion and fusion 
proteins. The latter forms contain the myc epitope and 6 His residues at their C termini. 
For protein identification, protein samples were analysed with SDS- PAGE and then 
followed by immunoblotting with the anti-NDV chicken serum and the anti-myc 
5 monoclonal antibody. The western blots for NP and P proteins are as shown in Figure 1 

and Figure 2, respectively. 

The expressed NP fusion protein was purified with affinity chromatography (nickel 
column), and was judged to be more than 90% pure by SDS-PAGE. 

The nucleotide sequences of the NP and P genes were determined by the ABI PRISM 
10 automated sequencer, model 377. The recombinant plasmids, pTrcHis2-NP and pTrcHis2- 
P, were used as templates and the synthetic primers used in the sequencing reactions of 
the NP and P genes are as follows: 

For the sequencing of the NP gene coding region 

pTrcHis2F (21 mer): 5'- gag gta tat att aat gta teg -3' 
15 sNPfl (21 mer): 5'- gac tea tac ate agg aac acc -3' 

sNPf2 (21 mer): 5'- gat gag age agt ggc gaa cag -3' 

pTrcHis2R ( 1 8 mer) : 5 5 - gat tta ate tgt ate agg -3 ' 

sNPrl (20 mer): 5'- tea ata ccc cca gtc ggt gt -3' 

sNPr2 ( 21 mer): 5 5 - eta agt tgt aat acg tgg age -3' 
20 sNPr3 (21 mer): 5'- cca teg ate tea aga aca tgc -3' 

For the sequencing of the P gene coding region 

pTrcHis2F (21 mer): 5'- gag gta tat att aat gta teg -3' 

sPfl (21 mer): 5'- gtc gac ttt gtg cag gcg atg -3' 

sPf2 (21 mer): 5'- gga cac tgt ccg tgc att gat -3' 

25 pTrcHis2.R (18 mer): 5'- gat tta ate tgt ate agg -3' 

sPrl (21 mer): 5'- cca ggg tec aga att ttc ate -3' 

sPr2 (22 mer): 5'- ggt gtg gat age tgt ttg tct g -3' 
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Both the NP and P coding regions were sequenced from 5 5 to 3' direction and reversely 
from 3' to 5 5 direction. 

Example I illustrates the recombinant DNA techniques employed in obtaining bacterial 
clones harbouring a plasmid containing inserts of NP and P coding cDNA for NDV 
5 genomic RNA, the nucleotide sequences of the NP and P genes, and also the expressed 

NP and P proteins. 

EXAMPLE I 

Virus Propagation 

The stock of NDV strain AF2240 was originally obtained from the Veterinary Research 
10 Institute (VRI), Ipoh. The virus was grown in the allantoic cavity of 8 to 9 day-old 
chicken embryonated eggs according to the procedures of Blaskovic and Styk (1967). 
After 3 - 4 days of incubation at 37°C, the eggs were chilled overnight at 4°C. The 
allantoic fluid was then harvested and the presence of the viruses was determined by 
haemagglutination (HA) test. The allantoic fluid which showed positive reaction of HA 
15 test was then clarified by centriftigation at 6000 xg for 20 min at 4°C (Beckman, JA14 
rotor, USA) to remove debris. 

Genomic RNA extraction 

Total RNA was extracted using the Trizol LS reagent (Gibco BRL, USA). Briefly, 250 jal 
of the virus infected allantoic fluid was mixed with 750 jal Trizol LS reagent and 

20 incubated for 5 min at room temperature. After incubation, 100 jal of l-bromo-3- 
chloropropane (BCP) (MRC, UK) was added and the mixtures were mixed vigorously for 
about 15 s and again incubated at room temperature for 10 min. The mixtures were phase 
separated by microcentrifugating at 13,000 xg for 15 min at 4°C (Jouan MR 1812, 
France). The RNA was then precipitated by adding 500 |al of isopropanol (Merck) to the 

25 aqueous phase and left at room temperature for 10 min. The precipitated RNA was 
microcentrifuged at 13,000 xg for 10 min and the pellet obtained was washed once with 
75% (v/v) diethyl pyrocarbonate (DEPC) (Sigma, USA) treated ethanol (Hamburg). The 
pellet was dissolved in 20 |al of DEPC treated dH 2 0. 
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cDNA synthesis and amplification of nucleocapsid (NP) and phosprotein (P) genes by 
RT-PCR 

The amplification reactions were carried out in a programmed thermal cycler (MJ 
Research Inc. USA). Synthesis of the first strand cDNA was performed in a final volume 
of 30 |al. The reaction mixture contained 0.4 |iM of each the forward and reverse primers, 
5 0.2 mM deoxynucleoside triphosphate (MBI Fermentas, Inc. USA), 5 U of AMV reverse 

transcriptase (Promega, USA), 8 U of RNase inhibitor (Gibco BRL, USA), 1.5 mM of 
MgCl 2 and lx of reaction buffer (50 mM Tris-HCl, 15 mM (NH 4 ) 2 S04, 0.1% Triton X- 
100). The mixture was incubated at 42°C for 30 min to synthesise the first strand of 
cDNA, and then 94°C for 3 min to inactivate the reverse transcriptase. 

10 For the amplification of the respective NP and P genes, another 20 jal of reaction mixture 
containing 1 U of DyNAzyme EXT DNA polymerase (FINNZYMES), 1.5 mM of MgCl 2 
and 1 x of reaction buffer was added to the top of the above cDNA mixture which was 
held at 94°C in the thermal cycler. The PCR profile for the amplification of NP gene 
comprising denaturation at 94°C for 30 s, annealing at 55°C for 50 s and extension at 

15 72°C for 1 min for a total of 30 cycles. To ensure a complete synthesis of the PCR 
product, the extension step at 72°C was prolonged for 7 min after the last cycle. The PCR 
profile for the amplification of P gene was basically similar to that of NP gene, except the 
annealing step was carried out at 55°C for 30 s. 

Purification of the amplified PCR products 

20 A total of 40 ^1 of the amplified PCR product was analysed on 1% TAE agarose gel. 
After the staining with ethidium bromide, the band with the correct size was excised from 
the gel and purified with the Wizard PCR Preps DNA Purification System (Promega, 
USA) according to the manufacturer's procedures. After purification, 5 |al of the PCR 
product was again analysed with agarose gel electrophoresis to determine the recovery of 

25 the PCR product, which would be used in TA cloning. 
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TOPO TA Cloning of NP and P genes 

Four |il of the purified NP or P DNA fragments carrying an A overhang at their 3' ends 
was mixed with 1 ja! of the pTrcHis2 TOPO expression vector (Invitogen, USA) and the 
ligation reaction was carried out at room temperature (25°C) for 5 min to form the desired 
5 recombinant plasmid. 

Transformation 

For transformation, 5 jal of the ligation mixture was added to 50 jal of TOP 10 E. coli 
competent cells (Invitrogen, USA). The transformation mixture was incubated on ice for 
30 min and the cells were heated at 42°C for 30 to 60 s. This was followed by the adding 
of 250 \x\ SOC medium (2% trypton,0.5% yeast extract, 10 mM NaCl, 2.5 mM KC1, 10 
10 mM MgCl 2 , 10 mM MgS0 4 , 20 mM glucose) and the incubation of the reaction mixture 
at 37°C for 30 to 60 min with shaking at 250 rpm. Thirty-50 jal of the transformation 
mixture was spread on a LB plate containing 50 |ag/ml ampicillin and 0.5% of glucose, 
and the plates were then incubated overnight at 37°C. 

Screening for positive clones 

j 5 Ten single colonies were randomly chosen and cultured overnight in 3 to 5 ml of LB 
medium containing 50 |ag/ml ampicillin and 0.5% glucose. Plasmid DNA was isolated by 
using the alkaline lysis method and the orientation of the insert in the positive clones was 
confirmed by PCR. 

Protein expression 

20 The identified positive clones were cultured overnight in LB medium containing 50 jag/ml 
ampicillin. The next day, 10 ml of LB medium containing 50 |ag/ml ampicillin was 
inoculated with 0.2 ml of the overnight culture and incubated at 37°C with shaking at 250 
rpm. Once the cells reached the optical density of 0.6 to 0.8 atA 6 oo, 1 mM IPTG was 




added into the culture and continued shaking for 3 to 5 hours. The cells were harvested 
from the culture by centrifugation and then subjected to polyacrylamide gel 
electrophoresis (SDS-PAGE). 

SDS-PAGE and western blotting 

5 The cell pellets (from 1 ml culture solution) were resuspended in 50 to 100 jil of IX SDS- 

PAGE sample buffer and boiled for 10 min. Five to 10 jal of the sample was loaded onto 
12% SDS-PAGE gel and eletrophoresesed for 70 to 80 min at 32 volt. The proteins on 
SDS-PAGE gel were then electro-transferred onto a nitrocellulose membrane for 1 h. 
Western blotting was carried out by blocking the membrane first with skim milk for 1 h to 

10 saturate unoccupied regions on the membrane. This was followed by adding the anti- 
NDV chicken serum or anti-myc monoclonal antibody (for fusion protein) onto the 
membrane and this was shaken for 1 h at room temperature. The membrane was then 
washed four times with TTBS washing solution ( TBS containing 0.5% Tween 20), 5 to 
10 min for each wash to remove the unbound antibodies. After washing, peroxidase - 

15 labelled antibody was added to react with the primary antibody and left shaking for 
another 1 h. The membrane was further washed four times with TTBS solution, each for 5 
to 10 min, and lastly BCIP/NBT solution was added as substrate for the peroxidase. The 
molecular weight of NP and P proteins was about 55 kDa while the fusion form for both 
the NP and P proteins gave rise to an apparent molecular weight of about 60 kDa. 

20 Purification of NP fusion protein using ProBond Column 

Two hundred \i\ of LB medium containing 50 |ig/ml ampicillin was cultured with 2 ml of 
overnight culture of transformant harbouring plasmid pTrcHis2-NP (carrying the NP 
insert without a stop codon), and the cells were grown to an OD600 of 0.6 to 0.8. Protein 
expression was then induced by adding 1 mM IPTG and the cells were grown for another 

25 5 h. The cells were harvested by centrifugation at 2000 xg for 15 min at 4°C. The cell 
pellet was first resuspended in 10 ml of binding buffer (500 mM NaCl, 20 mM NaH 2 P0 4 , 
pH 7.8), then 100 ng/ml of lysozyme was added and incubated for 15 min on ice. The 
cells were lysed by sonication until the cell lysate is no longer viscous. The cell lysate was 
then treated with RNase and DNase I, both at a concentration of 5 |a.g/ml for 1 5 min at 

30 30°C. The cell lysate was then centrifuged at 10,000 xg for 20 min to remove all the cell 
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debris. The supernatant was collected and passed through a 0.45 jam filter. This cell iysate 
was incubated with the ProBond resin (Invirogen, USA) for 30 min and then allowed to 
drip through the resin. The column was washed with 10 ml of washing buffer (50 mM 
Imidazole, 500 mM NaCl, 20 mM NaHiPO^ pH 6.0), and the proteins were then eluted 
5 with 5 ml of elution buffer (500 mM Imidazole, 500 mM NaCl, 20 mM NaH 2 P0 4 , pH 

6.0). The elute was collected as 1 ml fractions. Samples from each fractions were 
analysed on 12% SDS-PAGE to check the purity of the protein. 
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1. Nucleotides encoding the full length or part of the nucleocapsid (NP) protein of 
Newcastle disease virus (NDV). 



2. The nucleotides as claimed in claim 1 characterised in that it has the following 
nucleotide sequence: 



10 20 30 40 50 60 

ATGTCTTCCG TATTCGATGA ATACGAGCAG CTCCTCGCTG CTCAGACTCG CCCCAATGGA 

70 80 90 100 110 120 

GCTCACGGAG GGGGAGAGAG AGGGAGCACT TTAAGAGTTG AGGTCCCAGT ATTCACTCTT 

130 140 150 160 170 180 

AACAGTGACG ATCCAGAAGA TAGATGGAAT TTTGCGGTAT TCTGTCTTCG GATTGCTGTT 

190 200 210 220 230 240 

AGCGAGGACG CCAACAAACC GCTCAGGCAA GGTGCTCTCA TATCCCTCCT GTGCTCCCAT 

250 260 270 280 290 300 

TCTCAAGTGA TGAGGAACCA TGTTGCCCTT GCAGGAAAAC AGAATGAGGC TACACTGACT 

310 320 330 340 350 360 

GTTCTTGAGA TCGATGGTTT TACCAGCAGC GTGCCTCAGT TCAACAACAG GAGTGGGGTG 

370 380 390 400 410 420 

TCTGAGGAGA GAGCACAGAG ATT'CATGGTG ATAGCAGGGT CTCTCCCTCG GGCGTGCAGT 

430 440 450 460 470 480 

AACGGTACTC CGTTCGTCAC GGCTGGGGTT GAAGATGATG CACCAGAAGA TATCACTGAT 

490 500 510 520 530 540 

ACTCTGGAAA GAATCCTGTC TATCCAGGCT CAGGTATGGG TCACAGTAGC GAAGGCCATG 

550 560 570 580 590 600 

ACTGCATATG AGACAGCAGA TGAGTCGGAA ACAAGAAGAA TCAATAAGTA CAT G C AG C AA 

610 620 630 640 650 660 

GGCAGAGTCC AGAAGAAGTA CATCCTCCAC CCTGTATGCA GGAGTGCAAT TCAACTCACA 

670 680 690 700 710 720 

AT C AG AC AT T CTCTGGCAGT CCGCATTTTC TTAGTTAGCG AGCTTAAGAG AGGCCGCAAT 

730 740 750 760 770 780 

ACGGCAGGTG GGAGCTCCAC GTATTACAAC TTAGTAGGGG ATGTAGACTC ATACATCAGG 



790 800 810 820 830 840 

AACACCGGAC TTACTGCATT CTTCCTTACA CTCAAATATG GAATTAATAC CAAGACATCA 



22 



10 



15 



850 860 870 880 890 * 900 

GCCCTAGCAC TCAGCAGCCT CACAGGCGAT ATCCAAAAGA TGAAGCAGCT CATGCGTTTA 

910 920 930 940 950 960 
TATCGGATGA AGGGAGAAAA TGCGCCGTAC ATGACATTGC TAGGTGACAG TGATCAGATG 

970 980 990 1000 1010 1020 

AGCTTTGCAC CGGCTGAGTA TGCACAGCTT TATTCTTTTG CCATGGGCAT GGCATCAGTC 

1030 1040 1050 1060 1070 1080 

TTAGATAAAG GAACTGGCAA ATACCAATTC GCCAGAGACT TCATGAGCAC ATCATTCTGG 

1090 1100 1110 1120 1130 1140 

AGACTCGGGG TGGAGTATGC TCAGGCTCAG GGGAGTAGCA TCAACGAAGA CATGGCTGCT 

1150 1160 1170 1180 1190 1200 

GAGCTAAAAC TAACCCCGGC AGCAAGAAGG GGCCTGGCAG CTGCTGCCCA ACGAGTGTCT 

1210 1220 1230 1240 1250 1260 

GAGGAAACTG GCAGCGTGGA TATTCCTACT CAACAAGCCG GGGTCCTCAC TGGGCTCAGC 

1270 1280 1290 1300 1310 1320 

GATGGAGGCC CCCGAGCCTC TCAGGGTGGA TCGAACAAGT CGCAAGGGCA ACCAGATGCC 

1330 1340 1350 1360 1370 1380 

GGAGATGGGG AGACCCAATT CTTGGATTTG ATGAGAGCAG TGGCGAACAG CATGCGAGAA 



20 



1390 1400 1410 1420 1430 1440 

GCGCCAAACT CCGCACAGAG CACCACCCAC CCGGAACCCC CCCCGACTCC CGGGCCATCA 



1450 1460 1470 

CAAGATAACG ACACCGACTG GGGGTATTGA 



1480 



1490 



1500 



3. Nucleotides encoding the full length or part of the phosphoprotein (P) of 
Newcastle disease virus (NDV). 
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4. The nucleotides as claimed in claim 3 characterised in that it has the 
following nucleotide sequence: 



30 



10 20 30 40 50 60 

ATGGCCACCT TTACAGATGC GGAGATAGAT GATATATTTG AGACCAGTGG AACTGTCATT 

70 80 90 100 110 120 

GACAGCATAA TTACGGCCCA GGGTAAATCA GCAGAGACTG TCGGAAGGAG CGCAATCCCA 

130 140 150 160 170 180 

CAAGGCAAGA CCAAAGCGCT GAGCATAGCA TGGGAGAAGC ATGGGAGCAT CCAACCATCC 
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190 200 210 220 230 240 

ACCAGCCAGG ACAACCCCGA CCAACAGGAT AGACCAGACA AACAGCTATC CACACCTGAG 

250 260 270 280 290 300 

CAGGCGACCC CACACAACAG CTCGCCAGCC ACATCCGCCG AACCGCTCCC CACTCAGGCC 



310 320 330 340 350 360 

GCAGGTGAGG CCGGCGACAC ACAGCTCAAG ACCGGAGCAA GCAACTCTCT TCTGTCTATG 



370 330 390 400 410 420 

CTCGACAAGC TGAGCAATAA ACCATCTAAT GCTAAAAAGG GCCCATGGTC GAGTCCCCAG 

430 440 450 460 470 ' 400 

GAAGGATATC ATCAACCTCC GACCCAACAA CATGGGGATC AGCCGAACCG CGGAAACAGC 

490 500 510 520 530 540 

CAGGAGAGGC TGCGGCACCA AGCCAAGGCC GCCCCTGGAA GCCGGGGCAC AGACGCGAGC 

550 560 570 580 590 600 

ACAGCATATC ATGGACAATG GAAGGAGTCA CAACTATCAG CTGGTGCAAC CCCTCATGTG 

610 620 630 640 650 660 

CTCCAATCAG GGCAGAGCCA AGACAGTACT CCTGTACCTG TGGATCATGT CCAGCCACCT 

670 . 680 690 700 710 720 

GTCGACTTTG TGCAGGCGAT GATGACTATG ATGGAGGCGT TATCACAGAA GGTAAGTAAA 

730 740 750 760 770 780 

GTCGACTATC AGCTAGACCT AGTCTTAAAG CAGACATCCT CCATCCCTAT GATGCGGTCT 

790 800 810 820 830 840 

GAAATCCAAC AGCTAAAAAC ATCTGTTGCG GTCATGGAAG CTAATTTAGG CATGATGAAA 

.850 860 870 880 890 900 

ATTCTGGACC CTGGTTGTGC TAACATTTCA TCCTTAAGTG ATCTGCGGGC AGTCGCCCGG 

910 920 930 940 950 960 

TCCCACCCAG TTTTAATTTC AGGCCCCGGA GATCCGTCCC CCTACGTGAC ACAAGGGGGT 

970 980 990 1000 1010 1020 

GAGATGACAC TCAATAAACT CTCACAACCA GTACAACACC CTTCCGAGTT AATTAAATCT 

1030 1040 1050 1060 1070 1080 

GCCACAGCGG GCGGACCTGA TATGGGAGTG G AAAAG G AC A CTGTCCGTGC ATTGATCACC 

1090 1100 1110 1120 1130 1140 
TCGCGCCCGA TGCATCCAAG CTCCTCAGCT AAGCTCCTGA GTAAGCTGGA TGCAGCCGGG 

1150 1160 1170 1180 1190 1200 

TCGATTGAAG AG AT C AG AAA GATCAAGCGC CTTGCACTAA ATGGCTAA 



5. The NP protein coded according to claim 1 or claim 2 characterised in that 
it has the following amino acid sequence: 



1MSSV FDEYEQLLAAQT 16 
ATG TCT TCC GTA TTC GAT GAA TAG GAG CAG CTC CTC GCT GCT CAG ACT 
1 10 20 30 40 

17 RPNGAHGGGERGSTLR 32 
CGC CCC AAT GGA GCT CAC GGA GGG GGA GAG AGA GGG AGC ACT TTA AGA 
50 60 70 80 90 



¥ - 24 

33VEVPVFTLNSDDPEDR 48 
GTT GAG GTC CCA GTA TTC ACT CTT AAC AGT GAC GAT CCA GAA GAT AGA 
100 110 120 130 140 

49 W N FA V F C L R I AV S E.D-A 64 
5 TGG AAT TTT GCG GTA TTC TGT CTT CGG ATT GCT GTT AGC GAG GAC GCC 

150 160 170 ' ' ISO 190 

65NKPLRQGALI SLLCSH 80 
AAC AAA CCG CTC AGG CAA GGT GCT CTC ATA TCC CTC CTG TGC TCC CAT 
200 210 220 230 240 

10 81 SQVMRNHVALAGKQNE 96 

TCT CAA GTG ATG AGG AAC CAT GTT GCC CTT GCA GGA AAA CAG AAT GAG 
250 260 270 280 

97ATLTVLEI DGFTSSVP 112 
GCT ACA CTG ACT GTT CTT GAG ATC GAT GGT TTT ACC AGC AGC GTG CCT 
15 290 300 310 320 330 

113 QFNNRSGVSEERAQRF 128 
CAG TTC AAC AAC AGG AGT GGG GTG TCT GAG GAG AGA GCA CAG AGA TTC 
340 350 360 370 380 

129 MVIAGSLPRACSNGTP 144 
20 ATG GTG ATA GCA GGG TCT CTC CCT CGG GCG TGC AGT AAC GGT ACT CCG 

390 400 410 420 430 

145 FVTAGVEDDAPEDITD 160 
TTC GTC ACG GCT GGG GTT GAA GAT GAT GCA CCA GAA GAT ATC ACT GAT 
440 450 460 470 480 

25 161 TLERI LS IQAQVWVTV 176 

ACT CTG GAA AGA ATC CTG TCT ATC CAG GCT CAG GTA TGG GTC ACA GTA 
490 500 510 520 

177 AKAMTAYETADESETR 192 
GCG AAG GCC ATG ACT GCA TAT GAG ACA GCA GAT GAG TCG GAA ACA AGA 
30 530 540 550 560 570 

193 RINKYMQQGRVQKKYI 208 
AGA ATC AAT AAG TAC ATG CAG CAA GGC AGA GTC CAG AAG AAG TAC ATC 
580 590 600 610 620 

209 LHPVCRSAIQLTIRHS 224 
35 CTC CAC CCT GTA TGC AGG AGT GCA ATT CAA CTC ACA ATC AGA CAT TCT 

630 640 650 660 670 

225 LAVRI FLVSE LKRGRN 240 
CTG GCA GTC CGC ATT TTC TTA GTT AGC GAG CTT AAG AGA GGC CGC AAT 
680 690 700 710 720 

40 241 TAGGS STYYNLVGDVD 256 

ACG GCA GGT GGG AGC TCC ACG TAT TAC AAC TTA GTA GGG GAT GTA GAC 
730 740 750 760 

257 SYIRNTGLTAFFLTLK 272 
TCA TAC ATC AGG AAC ACC GGA CTT ACT GCA TTC TTC CTT ACA CTC AAA 
45 770 780 790 800 810 

273 YGINTKTSALALSSLT 288 
TAT GGA ATT AAT ACC AAG ACA TCA GCC CTA GCA CTC AGC AGC CTC ACA 
820 830 840 850 860 

289 GDIQKMKQLMRLYRMK 304 
50 GGC GAT ATC CAA AAG ATG AAG CAG CTC ATG CGT TTA TAT CGG ATG AAG 

870 880 890 900 910 
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305 GENAPYMTLLGDSDQM 320 
GGA GAA AAT GCG CCG TAC ATG ACA TTG CTA GGT GAC AGT GAT CAG ATG 
920. 930 940 '950 960 

'321 S FAPAEYAQLY S... F A M G . 336 

5 AGC TTT GCA CCG GCT GAG TAT GCA CAG CTT TAT TCT TTT GCC ATG GGC 

970 980 990 1000 

337 MASVLDKGTGKYQFAR 352 
ATG GCA TCA GTC TTA GAT AAA GGA ACT GGC AAA TAC CAA TTC GCC AGA 
10 1010 1020 1030 1040 1050 

353 DFMSTSFWRLGVEYAQ 368 
GAC TTC ATG AGC ACA TCA TTC TGG AGA CTC GGG GTG GAG TAT GCT CAG 
1060 1070 1080 1090 1100 

369 AQGSSINEDMAAELKL 384 
15 GCT CAG GGG AGT AGC ATC AAC GAA GAC ATG GCT GCT GAG CTA AAA CTA 

1110 1120 1130 1140 1150 

385 T PAARRGLAAAAQRVS 400 
ACC CCG GCA GCA AGA AGG GGC CTG GCA GCT GCT GCC CAA CGA GTG TCT 
1160 1170 1180 1190 1200 

20 401 EETGSVDI PTQQAGVL 416 

GAG GAA ACT GGC AGC GTG GAT ATT CCT ACT CAA CAA GCC GGG GTC CTC 
1210 1220 1230 1240 

417 TGLSDGGPRASQGGSN 432 
ACT GGG CTC AGC GAT GGA GGC CCC CGA GCC TCT CAG GGT GGA TCG AAC 
25 1250 1260 1270 1280 1290 

433 KSQGQPDAGDGETQFL 448 
AAG TCG CAA GGG CAA CCA GAT GCC GGA GAT GGG GAG ACC CAA TTC TTG 
1300 1310 1320 1330 1340 

449 DLMRAVANSMREAPNS 464 
30 GAT TTG ATG AGA GCA GTG GCG AAC AGC ATG CGA GAA GCG CCA AAC TCC 

1350 1360 1370 1380 1390 

4 65 A Q S T T H P E P P P T P G P S 480 
GCA CAG AGC ACC ACC CAC CCG GAA CCC CCC CCG ACT CCC GGG CCA TCC 
1400 1410 1420 1430 1440 

35 481 QDNDTDWGY* 490 

CAA GAT AAC GAC ACC GAC TGG GGG TAT TGA 
1450 1460 1470 



6. The P protein coded according to claim 3 or claim 4 characterised in that it 
has the following amino acid sequence: 



40 1MAT FTDAEIDDI FETS 16 

ATG GCC ACC TTT ACA GAT GCG GAG ATA GAT GAT ATA TTT GAG ACC AGT 
1 10 20 30 40 



45 



17 



GTV I DS I ITAQGKSAE 
GGA ACT GTC ATT GAC AGC ATA ATT ACG GCC CAG GGT AAA TCA GCA GAG 
50 60 70 80 90 
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33TVGRSAI PQGKTKALS 
ACT GTC GGA AGG AGC GCA ATC CCA CAA GGC AAG ACC AAA GCG CTG AGC 
100 110 120 130 140 

49 I A W E K .H.G S - I Q' P S T S Q D 
ATA GCA TGG GAG AAG CAT GGG AGC ATC CAA CCA TCC ACC AGC CAG GAC 
150 160 - 170 180 190 

65NPDQQDRPDKQLSTPE 
AAC CCC GAC CAA CAG GAT AGA CCA GAC AAA CAG CTA TCC ACA CCT GAG 
200 210 220 230 240 

81QATPHNSSPATSAEPL 
CAG GCG ACC CCA CAC AAC AGC TCG CCA GCC ACA TCC GCC GAA CCG CTC 
250 260 270 280 

97 PTQAAGEAGDTQLKTG 
CCC ACT CAG GCC GCA GGT GAG GCC GGC GAC ACA CAG CTC AAG ACC GGA 
290 300 310 320 330 

113 ASNSLLSMLDKLSNKP 
GCA AGC AAC TCT CTT CTG TCT ATG CTC GAC AAG CTG AGC AAT AAA CCA 
340 350 360 370 380 

129 SNAKKGPWSS PQEGYH 
TCT AAT GCT AAA AAG GGC CCA TGG TCG AGT CCC CAG GAA GGA TAT CAT 
390 400 410 420 430 

145 QPPTQQHGDQPNRGNS 
CAA CCT CCG ACC CAA CAA CAT GGG GAT CAG CCG AAC CGC GGA AAC AGC 
440 450 460 470 480 

161 QERLRHQAKAAPGSRG 
CAG GAG AGG CTG CGG CAC CAA GCC AAG GCC GCC CCT GGA AGC CGG GGC 
490 500 510 520 

177 TDASTAYHGQWKESQL 
ACA GAC GCG AGC ACA GCA TAT CAT GGA CAA TGG AAG GAG TCA CAA CTA 
530 540 550 560 570 

193 SAGAT PHVLQSGQSQD 
TCA GCT GGT GCA ACC CCT CAT GTG CTC CAA TCA GGG CAG AGC CAA GAC 
580 590 600 610 620 

209 STPVPVDHVQPPVDFV 
AGT ACT CCT GTA CCT GTG GAT CAT GTC CAG CCA CCT GTC GAC TTT GTG 
630 640 650 660 670 



225 QAMMTMMEALSQKVS K 
CAG GCG ATG ATG ACT ATG ATG GAG GCG TTA TCA CAG AAG GTA AGT AAA 
680 690 700 710 720 

241 VDYQLDLVLKQTSS I P 
GTC GAC TAT CAG CTA GAC CTA GTC TTA AAG CAG ACA TCC TCC ATC CCT 
730 740 750 760 

257 MMRSEIQQLKTSVAVM 
ATG ATG CGG TCT GAA ATC CAA CAG CTA AAA ACA TCT GTT GCG GTC ATG 
770 780 790 800 810 

273 EANLGMMKILDPGCAN 
GAA GCT AAT TTA GGC ATG ATG AAA ATT CTG GAC CCT GGT TGT GCT AAC 
820 830 840 850 860 



289 ISSLSDLRAVARSHPV 
ATT TCA TCC TTA AGT GAT CTG CGG GCA GTC GCC CGG TCC CAC CCA GTT 
870 880 890 900 910 
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305 LISGPGDPSPYVTQGG 320 
TTA ATT TCA GGC CCC GGA GAT . CCG TCC CCC TAC GTG ACA CAA GGG GGT 
920 930 940 950 960 

•321 EMTLNKLSQPVQHPS E 336 
5 GAG ATG ACA CTC AAT AAA CTC TCA CAA CCA GTA CAA CAC CCT TCC GAG 

970 980 990 1000 

337 L IKSATAGGPDMGVEK 352 
TTA ATT AAA TCT GCC ACA GCG GGC GGA CCT GAT ATG GGA GTG GAA AAG 
1010 1020 1030 1040 1050 

10 353 DTVRALITSRPMHPSS 368 

GAC ACT GTC CGT GCA TTG ATC ACC TCG CGC CCG ATG CAT CCA AGC TCC 
1060 1070 1080 1090 1100 

369 SAKLLSKLDAAGS IEE 384 
TCA GCT AAG CTC CTG AGT AAG CTG GAT GCA GCC GGG TCG ATT GAA GAG 
15 1110 1120 1130 1140 1150 

385 I RKIKRLALNG* 396 
ATC AGA AAG ATC AAG CGC CTT GCA CTA AAT GGC TAA 
1160 1170 1180 



7. A recombinant expression plasmid containing the NDV nucleocapsid gene as 
20 claimed in claim 1 or claim 2. 

8. A recombinant expression plasmid containing the NDV phosphoprotein gene 
as claimed in claim 3 or claim 4. 

9. The recombinant expression plasmid according to claim 7 which is the 
expression plasmid pTrcHis2-NP constructed by cloning the NDV 

25 nucleocapsid gene of claims 1 or 2 into vector pTrcHis2. 

10. The recombinant expression plasmid according to claim 8 which is the 
expression plasmid pTrcHis2-P constructed by cloning the NDV 
phosphoprotein gene of claims 3 or 4 into vector pTrcHis2. 

1 1 . A transformed Escherichia coli with the recombinant expression plasmid 
30 according to claim 7 or claim 9. 



12. A transformed Escherichia coli with the recombinant expression plasmid 
according to claim 8 or claim 10. 




13. The transformed microorganism according to claim 11, which is the 
transformed E. coli TOP 10 (pTrcHis2-NP) produced by introducing the 
recombinant expression plasmid of claim 7 or claim 9 into E. coli TOP 10. 

14. The transformed microorganism according to claim 12, which is the 
5 transformed E. coli (pTrcHis2-P) produced by introducing the recombinant 

expression plasmid of claim 8 or claim 10 into E. coli TOP 10. 



15. A fused or non-fused form of NDV nucleocapsid protein isolated and purified 
from culture of the transformed microorganism of claim 11 or claim 13 
characterised in that it has the following amino acid sequence: 



10 1MSSVFDEYEQLLAAQT 16 

ATG TCT TCC GTA TTC GAT GAA TAC GAG CAG CTC CTC GCT GCT CAG ACT 
1 10 20 30 40 
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17RPNGAHGGGERGSTLR 32 
CGC CCC AAT GGA GCT CAC GGA GGG GGA GAG AGA GGG AGC ACT TTA AGA 
50 60 70 80 90 

33 VEVPVFTLNSDDPEDR 48 
GTT GAG GTC CCA GTA TTC ACT CTT AAC AGT GAC GAT CCA GAA GAT AGA 
100 110 120 130 140 

49 WN FAVFCLRIAVSEDA 64 
20 TGG AAT TTT GCG GTA TTC TGT CTT CGG ATT GCT GTT AGC GAG GAC GCC 

150 160 170 180 190 

65NKPLRQGALISLLCSH 80 
AAC AAA CCG CTC AGG CAA GGT GCT CTC ATA TCC CTC CTG TGC TCC CAT 
200 210 220 230 240 

25 81 SQVMRN HVALAGKQNE 96 

TCT CAA GTG ATG AGG AAC CAT GTT GCC CTT GCA GGA AAA CAG AAT GAG 
250 260 270 280 

97ATLTVLEI DGFTS SVP 112 
GCT ACA CTG ACT GTT CTT GAG ATC GAT GGT TTT ACC AGC AGC GTG CCT 
30 290 300 310 320 330 

113 QFNNRSGVSEERAQRF 128 
CAG TTC AAC AAC AGG AGT GGG GTG TCT GAG GAG AGA GCA CAG AGA TTC 
340 350 360 370 380 

129 MVIAGSLPRACSNGTP 144 
35 ATG GTG ATA GCA GGG TCT CTC CCT CGG GCG TGC AGT AAC GGT ACT CCG 

390 400 410 420 430 

145 FVTAGVEDDAPED ITD 160 
TTC GTC ACG GCT GGG GTT GAA GAT GAT GCA CCA GAA GAT ATC ACT GAT 
440 450 460 470 480 

40 161 TLERILSIQAQVWVTV 176 

ACT CTG GAA AGA ATC CTG TCT ATC CAG GCT CAG GTA TGG GTC ACA GTA 
490 500 510 520 
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177 AKAMTAYETADESETR 192 
GCG AAG GCC ATG ACT GCA TAT GAG ACA GCA GAT GAG TCG GAA ACA AGA 
530 540 550 560 570 

193 . R I NKYMQQ GRVQ KKY I 208 
5 ACA ATC AAT AAG TAC ATG, C AG CAA GGC AGA GTC CAG AAG AAG TAC ATC 

5R0 590 *00 610 * 620 

209 LHPVCRSAIQLTIRHS 224 
CTC CAC CCT GTA TGC AGG AGT GCA ATT CAA CTC ACA ATC AGA CAT TCT 
630 640 650 660 670 

10 225 LAVRI FLVSELKRGRN 240 

CTG GCA GTC CGC ATT TTC TTA GTT AGC GAG CTT AAG AGA GGC CGC AAT 
680 690 700 710 720 

241 TAGGSSTYYNLVGDVD 256 
ACG GCA GGT GGG AGC TCC ACG TAT TAC AAC TTA GTA GGG GAT GTA GAC 
15 730 740 750 760 

257 SYIRNTGLTAFFLTLK 272 
TCA TAC ATC AGG AAC ACC GGA CTT ACT GCA TTC TTC CTT ACA CTC AAA 
770 780 790 800 810 

273 YGINTKTSALALSSLT 288 
20 TAT GGA ATT AAT ACC AAG ACA TCA GCC CTA GCA CTC AGC AGC CTC ACA 

820 830 840 850 860 

289 GDIQKMKQLMRLYRMK 304 
GGC GAT ATC CAA AAG ATG AAG CAG CTC ATG CGT TTA TAT CGG ATG AAG 
870 880 890 900 910 

25 305 GENAPYMTLLGDSDQM 320 

GGA GAA AAT GCG CCG TAC ATG ACA TTG CTA GGT GAC AGT GAT CAG ATG 
920 930 940 950 960 

321 S FAPAEYAQLYS FAMG 336 
AGC TTT GCA CCG GCT GAG TAT GCA CAG CTT TAT TCT TTT GCC ATG GGC 
30 970 980 990 1000 

337 MASVLDKGTGKYQFAR 352 
ATG GCA TCA GTC TTA GAT AAA GGA ACT GGC AAA TAC CAA TTC GCC AGA 
1010 1020 1030 1040 1050 

353 DFMSTSFWRLGVEYAQ 368 
35 GAC TTC ATG AGC ACA TCA TTC TGG AGA CTC GGG GTG GAG TAT GCT CAG 

1060 1070 1080 1090 1100 

369 AQGSSINEDMAAELKL 384 
GCT CAG GGG AGT AGC ATC AAC GAA GAC ATG GCT GCT GAG CTA AAA CTA 
1110 1120 1130 1140 1150 

40 385 T PAARRG LAAAAQRV S 400 

ACC CCG GCA GCA AGA AGG GGC CTG GCA GCT GCT GCC CAA CGA GTG TCT 
1160 1170 1180 1190 1200 

401 EETGSVDI PTQQAGVL 416 
GAG GAA ACT GGC AGC GTG GAT ATT CCT ACT CAA CAA GCC GGG GTC CTC 
45 1210 1220 1230 1240 

417 TGLSDGGPRASQGGSN 432 
ACT GGG CTC AGC GAT GGA GGC CCC CGA GCC TCT CAG GGT GGA TCG AAC 
12 50 12 60 127 0 128 0 12 90 

433 KSQGQPDAGDGETQFL 448 
50 AAG TCG CAA GGG CAA CCA GAT GCC GGA GAT GGG GAG ACC CAA TTC TTG 

1300 1310 1320 1330 1340 
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449 DLMRAVANSMREAPNS 464 
GAT TTG ATG AGA GCA GTG GCG AAC AGC ATG CGA GAA GCG CCA AAC TCC 
1350 1360 1370 1380 1390 

465 AQ3TTHPEPPPTP G PS 480 
GCA CAG AGC ACC ACC CAC CCG GAA CCC CCC CCG ACT CCC GGG CCA TCC 
1400 1410 1420 1430 1440 

481 QDNDTDWGY* 490 
CAA GAT AAC GAC ACC GAC TGG GGG TAT TGA 
1450 1460 1470 

16. A fused or non-fused form of NDV phosphoprotein isolated and purified from 
culture of the transformed microorganism of claim 12 or claim 14 
characterised in that it has the following amino acid sequence: 



1MATFT DAEIDDIFETS 16 
ATG GCC ACC TTT ACA GAT GCG GAG ATA GAT GAT ATA TTT GAG ACC AGT 
1 10 20 30 40 

17GTVI DS I ITAQGKSAE 32 
GGA ACT GTC ATT GAC AGC ATA ATT ACG GCC CAG GGT AAA TCA GCA GAG 
50 60 70 80 90 



33TVGRSAI PQGKTKALS 48 
ACT GTC GGA AGG AGC GCA ATC CCA CAA GGC AAG ACC AAA GCG CTG AGC 
100 110 120 130 140 

49IAWEKHGSIQPSTSQD 64 
ATA GCA TGG GAG AAG CAT GGG AGC ATC CAA CCA TCC ACC AGC CAG GAC 
150 160 170 180 190 

65 N PDQQDRPDKQLST PE 80 
AAC CCC GAC CAA CAG GAT AGA CCA GAC AAA CAG CTA TCC ACA CCT GAG 
200 210 220 230 240 

81QATPHNSSPATSAEPL 96 
CAG GCG ACC CCA CAC AAC AGC TCG CCA GCC ACA TCC GCC GAA CCG CTC 
250 260 270 280 

97 PTQAAGEAGD TQLKTG 112 
CCC ACT CAG GCC GCA GGT GAG GCC GGC GAC ACA CAG CTC AAG ACC GGA 
290 300 310 320 330 

113 ASNSLLSMLDKLSNKP 128 
GCA AGC AAC TCT CTT CTG TCT ATG CTC GAC AAG CTG AGC AAT AAA CCA 
340 350 360 370 380 

129 SNAKKGPWSS PQEGYH 144 
TCT AAT GCT AAA AAG GGC CCA TGG TCG AGT CCC CAG GAA GGA TAT CAT 
390 400 410 420 430 

145 QPPTQQHGDQPNRGNS 160 
CAA CCT CCG ACC CAA CAA CAT GGG GAT CAG CCG AAC CGC GGA AAC AGC 
440 450 460 470 480 

161 QERLRHQAKAAPGSRG 176 
CAG GAG AGG CTG CGG CAC CAA GCC AAG GCC GCC CCT GGA AGC CGG GGC 
490 500 510 520 
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177 TDASTAYHGQWKESQL 192 
ACA GAC GCG AGC ACA GCA TAT CAT GGA CAA TGG AAG GAG TCA CAA CTA 
530 540 550 560 570 

193 SAGATPHVLQSGQSQD 208 
5 TCA GCT GGT GCA ACC CCT CAT GTG CTC CAA TCA GGG CAG AGC CAA GAC 

580 590 600 610 620 

209 STPVPVDHVQPPVDFV 224 
AGT ACT CCT GTA CCT GTG GAT CAT GTC CAG CCA CCT GTC GAC TTT GTG 
630 640 650 660 670 

10 225 QAMMTMMEALSQKVSK 240 

CAG GCG ATG ATG ACT ATG ATG GAG GCG TTA TCA CAG AAG GTA AGT AAA 
680 690 700 710 720 

241 VDYQLDLVLKQTS SIP 256 
GTC GAC TAT CAG CTA GAC CTA GTC TTA AAG CAG ACA TCC TCC ATC CCT 
15 730 740 750 760 

257 MMRSE IQQLKTSVAVM 272 
ATG ATG CGG TCT GAA ATC CAA CAG CTA AAA ACA TCT GTT GCG GTC ATG 
770 780 790 800 810. 

273 EANLGMMKILDPGCAN 288 
20 GAA GCT AAT TTA GGC ATG ATG AAA ATT CTG GAC CCT GGT TGT GCT AAC 

820 830 840 850 860 

289 ISSLSDLRAVARSHPV 304 
ATT TCA TCC TTA AGT GAT CTG CGG GCA GTC GCC CGG TCC CAC CCA GTT 
870 880 890 900 910 

25 305 LISGPGDPSPYVTQGG 320 

TTA ATT TCA GGC CCC GGA GAT CCG TCC CCC TAC GTG ACA CAA GGG GGT 
920 930 940 950 960 

321 EMTLNKLSQPVQH PSE 336 
GAG ATG ACA CTC AAT AAA CTC TCA CAA CCA GTA CAA CAC CCT TCC GAG 
30 970 980 990 1000 

337 LIKSATAGGPDMGVEK 352 
TTA ATT AAA TCT GCC ACA GCG GGC GGA CCT GAT ATG GGA GTG GAA AAG 
1010 1020 1030 1040 1050 

353 DTVRALITSRPMHPSS 368 
35 GAC ACT GTC CGT GCA TTG ATC ACC TCG CGC CCG ATG CAT CCA AGC TCC 

1060 1070 1080 1090 1100 

369 SAKLLSKLDAAGS IEE 384 
TCA GCT AAG CTC CTG AGT AAG CTG GAT GCA GCC GGG TCG ATT GAA GAG 
1110 1120 1130 1140 1150 

40 385 I R K I KRLALNG* 396 

ATC AGA AAG ATC AAG CGC CTT GCA CTA AAT GGC TAA 
1160 1170 1180 
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ABSTRACT 



Nucleotide Sequences of the Nucleocapsid (NP) and Phosphoprotein (P) Genes of a 
Malaysian Velogenic Newcastle Disease Virus Strain AF2240 and the Production of 

the NP and P Proteins in Escherichia coli 



The present invention relates to nucleotide sequences encoding the nucleocapsid (NP) 
protein and phosphoprotein (P) of Newcastle disease virus (NDV) and the production of 
the corresponding proteins with recombinant plasmids bearing the nucleotide sequences in 
Escherichia coli. 
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Detection of NP protein with anti-NDV chicken serum 
lanes: 

1 : Molecular mass standards expressed in kDa 

2 & 3: NP fusion protein 

4 & 5: NP non- fusion protein 

6:NDV 



Figure 1 



+ 4 

34 





Detection of P fusion protein with the anti-iWyc monoclonal antibody 
lanes: 

1 : Cells containing the recombinant P fusion plasmid after being induced with IPTG for 5 h 
2: Cells containing the recombinant P fusion plasmid after being induced with IPTG for 3 h 
3: Cells containing the recombinant P fusion plasmid after being induced with IPTG for 1 h 
4: Cells containing the recombinant P fusion plasmid before being induced with IPTQ 
5: Cells harbouring empty vector 
6: Molecular mass standards expressed in kDa 



1 2 3 4 5 6 




Detection of P non-fusion protein with anti-NDV chicken serum 
lanes: 

1 : Molecular mass standards expressed in kDa 
2: Cells harbouring empty vector 

3: Cells containing the recombinant P non-fusion plasmid before being induced with IPTG 
4: Cells containing the recombinant P non-fusion plasmid after being induced with IPTG for 2 h 
5: Cells containing the recombinant P non-fusion plasmid after being induced with IPTG for 4 h 
6: Cells containing the recombinant P non-fusion plasmid after being induced with IPTG for 6 h 



Figure 2 



